Model Selection

High-accuracy transcription

# High-accuracy transcription

GigaAM v2 is an automatic speech recognition (ASR) model that supports Russian speech-to-text tasks, offering both CTC and RNN-T architectures.

Speech Recognition Other

Whisper Small Turkish 0

Turkish speech recognition model fine-tuned based on OpenAI Whisper-small

Speech Recognition

Transformers Other

Kotoba Whisper V2.2

Japanese automatic speech recognition model based on Whisper, integrating speaker separation and punctuation addition functions

Speech Recognition

Transformers Japanese

Whisper Large V2 Ko

Korean automatic speech recognition (ASR) model fine-tuned based on OpenAI Whisper-large-v2, excelling on Korean datasets

Speech Recognition

Transformers Korean

Whisper Large V2 Hausa

This model is a fine-tuned version of OpenAI's Whisper Large-V2 for Hausa speech recognition tasks, trained on the Common Voice 11.0 dataset

Speech Recognition

Transformers Other

Stt Kr Conformer Transducer Large

This is a large-scale Korean automatic speech recognition model based on the Conformer-Transducer architecture, trained on the Ksponspeech dataset, suitable for Korean speech transcription tasks.

Speech Recognition Other

S2t Medium Librispeech Asr

A speech-to-text (S2T) model for automatic speech recognition (ASR), based on a sequence-to-sequence transformer architecture

Speech Recognition

Transformers English

Wav2vec2 Large Xls R 300m Assamese Cv8

This is an automatic speech recognition (ASR) model fine-tuned on Assamese datasets based on the facebook/wav2vec2-xls-r-300m model

Speech Recognition

Transformers Other

Wav2vec2 Base 10k Voxpopuli Ft Hr

A speech recognition model based on Facebook's Wav2Vec2 architecture, pretrained on the VoxPopuli corpus and fine-tuned on Croatian data

Speech Recognition

Transformers Other

Wav2vec2 Base 10k Voxpopuli Ft Nl

A speech recognition model based on Facebook's Wav2Vec2 architecture, pretrained on 10K hours of unlabeled Dutch data from the VoxPopuli corpus and fine-tuned on Dutch transcription data.

Speech Recognition

Transformers Other

Wav2vec2 Punjabi Stt

This is a Punjabi speech recognition model based on the Wav2Vec2 architecture, capable of converting Punjabi speech into text.

Speech Recognition

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase